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(57) ABSTRACT 

A subscriber characterization system is presented in which 
the subscriber's requests are transmitted to a server which 
fulfills those requests and performs monitoring of the sub- 
scriber requests for subsequent characterization of the sub- 
scriber. Monitoring includes maintaining records of the time 
duration programming is watched, the volume at which the 
programming is listened to, and any available information 
regarding the type of programming, including category and 
sub-category of the programming. The characterization sys- 
tem works across a network to extract textual information 
related to the programming from closed captioning data, 
electronic program guides, or other text sources associated 
with the programming. The extracted information is used to 
form program characteristics vectors. The programming 
characteristics vectors can be used in combination with the 
subscriber selection data to form a subscriber profile. Heu- 
ristic rules indicating the relationships between program- 
ming choices and demographics can be applied to generate 
additional probabilistic information regarding demographics 
and programming and product interests. The probabilistic 
information can be accessed at the server by other entities on 
the network. 
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CLIENT-SERVER BASED SUBSCRIBER 
CHARACTERIZATION SYSTEM 

BACKGROUND OF THE INVENTION 

Subscribers face an increasingly large number of choices 
for entertainment programming, which is delivered over 
networks such as cable TV systems, over-the-air broadcast 
systems, and switched digital access systems which use 
telephone company twisted wire pairs for the deliver of 
signals. 

Cable television service providers have typically provided 
one-way broadcast services but now offer high-speed data 
services and can combine traditional analog broadcasts with 
digital broadcasts and access to Internet web sites. Tele- 
phone companies can offer digital data and video program- 
ming on a switched basis over digital subscriber line tech- 
nology. Although the subscriber may only be presented with 
one channel at a time, channel change requests are instan- 
taneously transmitted to centralized switching equipment 
and the subscriber can access the programming in a 
broadcast-like manner. Internet Service Providers (ISPs) 
offer Internet access and can offer access to text, audio, and 
video programming which can also be delivered in a 
broadcast-like manner in which the subscriber selects "chan- 
nels" containing programming of interest. Such channels 
may be offered as part or a video programming service or 
within a data service and can be presented within an Internet 
browser. 

Along with the multitude of programming choices which 
the subscriber faces, subscribers are subject to 
advertisements, which in many cases subsidize or pay for the 
entire cost of the programming. While advertisements are 
sometimes beneficial to subscribers and deliver desired 
information regarding specific products or service, consum- 
ers generally vie advertising as a "necessary evil" for 
broadcast type entertainment. 

In order to deliver more targeted programming and adver- 
tising to subscribers, it is necessary to understand their likes 
and dislikes to a greater extent than is presently done today. 
Systems which identify subscriber preferences based on 
their purchases and responses to questionnaires allow for the 
targeted marketing of literature in the mail, but do not in any 
sense allow for the rapid and precise delivery of program- 
ming and advertising which is known to have a high 
probability of acceptance to the subscriber. In order to 
determine which programming or advertising is appropriate 
for the subscriber, knowledge of that subscriber and the 
subscriber product and programming preferences is 
required. 

Specific information regarding a subscriber's viewing 
habits or the Internet web sites they have accessed can be 
stored for analysis, but such records are considered private 
and subscribers are not generally willing to have such 
information leave their control. Although there arc regula- 
tory models which permit the collection of such data on a 
"notice and consent" basis, there is a general tendency 
towards legal rules which prohibit such raw data to be 
collected. 

With the migration of services from a broadcast based 
model to a client-server based model in which subscribers 
make individualized request for programming to an Internet 
access provider or content provider, there is opportunity to 
monitor the subscriber viewing characteristics to better 
provide them with programming and advertising which will 
be of interest to them. A server may act as a proxy for the 
subscriber requests and thus be able to monitor what a 
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subscriber has requested and is viewing. Since subscribers 
may not want this raw data to be utilized, there is a need for 
a system which can process this information and generate 
statistically relevant subscriber profiles. These profiles 

5 should be accessible to others on the network who may wish 
to determine if their programming or advertisements are 
suitable for the subscriber. 

For the foregoing reasons, there is a need for a subscriber 
characterization system which can generate and store sub- 

io scriber characteristics which reflect the probable demo- 
graphics and preferences of the subscriber and household. 

SUMMARY OF THE INVENTION 

The present invention includes a system for characterizing 

15 subscribers watching video or multimedia programming 
based on monitoring the requests made by the subscriber for 
programming to a server which contains the content or 
which requests the content from a third party. The server side 
of the network is able to monitor the subscriber's detailed 

20 selection choices including the time duration of their 
viewing, the volume the programming is listened at, and the 
program selection. 

The server side collects text information about that pro- 
gramming to determine what type of programming the 

25 subscriber is most interested in. In addition the system can 
generate a demographic description of the subscriber or 
household which describes the probable age, income, gender 
and other demographics. The resulting characterization 
includes probabilistic determinations of what other pro- 

30 gramming or products the subscriber/household will be 
interested in. 

In a preferred embodiment the textual information which 
describes the programming is obtained by context mining of 

35 text associated with the programming. The associated text 
can be from the closed-captioning data associated with the 
programming, an electronic program guide, or from text files 
associated with or part of the programming itself. 
The system can provide both session measurements which 

4Q correspond to a profile obtained over a viewing session, or 
an average profile which corresponds to data obtained over 
multiple viewing sessions. 

The present invention also encompasses the use of heu- 
ristic rules in logical form or expressed as conditional 

45 probabilities to aid in forming a subscriber profile. The 
heuristic rules in logical form allow the system to apply 
generalizations which have been learned from external stud- 
ies to obtain a characterization of the subscriber. In the case 
of conditional probabilities, determinations of the probable 

50 content of a program can be applied in a mathematical step 
to a matrix of conditional probabilities to obtain probabilis- 
tic subscriber profiles indicating program and product likes 
and dislikes as well for determining probabilistic demo- 
graphic data. 

55 One advantage of the present invention is that it allows 
consumers the possibility to permit access to probabilistic 
information regarding their household demographics and 
programming/product preferences, without revealing their 
specific viewing history. Subscribers may elect to permit 

60 access to this information in order to receive advertising 
which is more targeted to their likes/dislikes. Similarly, a 
subscriber may wish to sell access to this statistical data in 
order to receive revenue or receive a discount on a product 
or a service. 

65 Another advantage of the present invention is that the 
resulting probabilistic information can be stored locally and 
controlled by the subscriber, or can be transferred to a third 
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party which can provide access to the subscriber character- for the sake of clarity. However, the invention is not intended 

ization. The information can also be encrypted to prevent to be limited to the specific terms so selected, and it is to be 

unauthorized access in which case only the subscriber of understood that each specific term includes all technical 

someone authorized by the subscriber can access the data. equivalents which operate in a similar manner to accomplish 

These and other features and objects of the invention will 5 a similar purpose, 

be more fully understood from the following detailed with reference t0 tne drawings, in general, and FIGS. 1 

description of the preferred embodiments which should be through 18 in particular, the apparatus of the present inven- 

read in light of the accompanying drawings. [ion fc disclosed 

BRIEF DESCRIPTION OF THE DRAWINGS The present invention is directed at an apparatus for 
The accompanying drawings, which are incorporated in 10 generating a subscriber profile which contains useful infor- 
and form a part of the specification, illustrate the embodi- mation regarding the subscriber likes and dislikes. Such a 
menis of the present invention and, together with the profile is useful for systems which provide targeted pro- 
description serve to explain the principles of the invention. gramming or advertisements to the subscriber, and allow 
In the drawings: material (programs or advertisements) to be directed at 
FIG. 1 shows a context diagram for a subscriber charac- 15 subscribers who will have a high probability of liking the 
terization system. program or a high degree of interest in purchasing the 

FIG. 2 illustrates a block diagram for a realization of a product, 

subscriber monitoring system for receiving video signals; Since there are typically multiple individuals in a 

FIG. 3 illustrates a block diagram of a channel processor; household, the subscriber characterization may not be a 

FIG. 4 illustrates a block diagram of a computer for a 20 characterization of an individual subscriber but may instead 

realization of the subscriber monitoring system; be a household average. When used herein, the term sub- 

FIG. 5 illustrates a channel sequence and volume over a scriber refers both to an individual subscriber as well as the 

twenty-four (24) hour period; average characteristics of a household of multiple subscrib- 

FIG. 6 illustrates a time of day detailed record; ers. 

FIG. 7 illustrates a household viewing habits statistical 25 i n the present system the programming viewed by the 

table; subscriber, both entertainment and advertisement, can be 

FIG. 8A illustrates an entity-relationship diagram for the studied and processed by the subscriber characterization 

generation of program characteristics vectors; system to determine the program characteristics. This deter- 

FIG. 8B illustrates a flowchart for program characteriza- 30 mination of the program characteristics is referred to as a 

tion; program characteristics vector. The vector may be a truly 

FIG. 9 A illustrates a deterministic program category vec- one-dimensional vector, but can also be represented as an n 

tor; dimensional matrix which can be decomposed into vectors. 

FIG. 9B illustrates a deterministic program sub-category The subscriber profile vector represents a profile of the 

vector; 35 subscriber (or the household of subscribers) and can be in 

FIG. 9C illustrates a deterministic program rating vector; toe form of a demographic profile (average or session) or a 

FIG. 9D illustrates a probabilistic program category vec- P ro S ram or P roduct preference vector. The program and 

tor product preference vectors are considered to be part of a 

FIG. 9E illustrates a probabilistic program sub-category houschold interest profile which can be thought of as an n 

vector . 40 dimensional matrix representing probabilistic measurements 

ni^'ftc-n . . u u-i- .• . . . of subscriber interests. 

FIG. 9F illustrates a probabilistic program content vector; , . . . 

™„ „ n . ... . . . e . . , 7 ... , In the case that the subscriber profile vector is a demo- 

HG. 10A illustrates a set 01 logical heuristic rules: Cl . ai . ,. . 

__ ... , , 6 . . , , . graphic profile, the subscriber profile vector indicates a 

FIG. 10B illustrates a set of heuristic rules expressed in probabilistic measure of the age of the subsC riber or average 

terms of conditional probabilities; ^ agc of (hc vicwcrs in (hc houscholdi ^ of lhc subscri ber, 

FIG. 11 illustrates an entity-relationship diagram for the income range of the subscriber or household, and other such 

generation of program demographic vectors; demographic data. Such information comprises household 

FIG. 12 illustrates a program demographic vector; demographic characteristics and is composed of both aver- 

FIG. 13 illustrates an entity-relationship diagram for the age and session values. Extracting a single set of values from 

generation of household session demographic data and 50 the household demographic characteristics can correspond 

household session interest profiles; to a subscriber profile vector. 

FIG. 14 illustrates an entity-relationship diagram for the The household interest profile can contain both program- 
generation of average and session household demographic ming and product profiles, with programming profiles cor- 
characteristics; responding to probabilistic determinations of what program- 

F1G. 15 illustrates average and session household demo- 55 ming the subscriber (household) is likely to be interested in, 

graphic data; and product profiles corresponding to what products the 

FIG. 16 illustrates an entity-relationship diagram for subscriber (household) is likely to be interested in. These 

generation of a household interest profile; profiles contain both an average value and a session value, 

FIG. 17 illustrates household interest profile including tne average value being a lime average of data, where the 

programming and product profiles; and 60 averaging period may be several days, weeks, months, or the 

FIG. 18 illustrates a client-server architecture, realizing lime between resets of unit, 

the present invention. since a viewing session is likely to be dominated by a 

particular viewer, the session values may, in some 

D ™!:, ED ^C R IPTION OF THE circumstances, correspond most closely to the subscriber 

PREFERRED EMBODIMENTS 65 vaIueSf while the average values raaVt in some 

In describing a preferred embodiment of the invention circumstances, correspond most closely to the household 

illustrated in the drawings, specific terminology will be used values. 
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FIG. 1 depicts the context diagram of a preferred embodi- One of the functions of the SCS 100 is to generate the 

ment of a Subscriber Characterization System (SCS) 100. A program characteristics vectors 150 which are comprised of 

context diagram, in combination with entity-relationship program characteristics data 152, as illustrated in FIG. 1. 

diagrams, provide a basis from which one skilled in the art The program characteristics data 152, which can be used to 

can realize the present invention. The present invention can 5 create the program characteristics vectors 150 both in vector 

be realized in a number of programming languages includ- and table form, are examples of source related information 

ing C, C++, Perl, and Java, although the scope of the which represent characteristics of the source material. In a 

invention is not limited by the choice of a particular pro- preferred embodiment, the program characteristics vectors 

gramming language or tool. Object oriented languages have 150 are lists of values which characterize the programming 

several advantages in terms of construction of the software 10 (source) material in according to the category 144, the 

used to realize the present invention, although the present sub-category 146, and the program description 148. The 

invention can be realized in procedural or other types of present invention may also be applied to advertisements, in 

programming languages known to those skilled in the arl. which case program characteristics vectors contain, as an 

In generating a subscriber profile, the SCS 100 receives example, a product category, a product sub-category, and a 

from a user 120 commands in the form of a volume control 35 brand name. 

signal 124 or program selection data 122 which can be in the As illustrated in FIG. 1, the SCS 100 uses heuristic rules 

form of a channel change but may also be an address request 160. The heuristic rules 160, as described herein, are com- 

which requests the delivery of programming from a network posed of both logical heuristic rules as well as heuristic rules 

address. A record signal 126 indicates that the programming expressed in terms of conditional probabilities. The heuristic 

or the address of the programming is being recorded by the 20 rules 160 can be accessed by the SCS 100 via a request rules 

user. The record signal 126 can also be a printing command, signal 162 which results in the transfer of a copy of rules 164 

a tape recording command, a bookmark command or any and the SCS 100, 

other command intended to store the program being viewed, The SCS 100 forms program demographic vectors 170 

or program address, for later use. from program demographics 172, as illustrated in FIG. 1. 

<>T eTm"ateqaU^^ The program demographic vectors 170 also represent char- 

soia;ce?mj2eaali33QM^^ acteristics of source related information in the form of the 

KaSnliiSESS^^ intended or expected demographics of the audience for 

^tisBt^ofeanalcjrav^ which the source material is intended. 
(TiKlBEiS^ig taU*^ ^ -J Subscriber selection data 110 is obtained from the moni- 

4togjmatej|^ tored activities of the user and in a preferred embodiment 

tVDjlSfimuli^^ J can be stored in a dedicated memory. In an alternate 

^uMSyM y j^ 100 ^^^^ 3 !! ^ ^ ^!™^^ ^ 130 * > embodiment, the subscriber selection data 110 is stored in a 

f ^^l^ ^ipj W^^P^ ^^^y s jfi5^gj^ a |}fe^ ^M^ifl'wM^? 1 ! ' a "-y t^P — *? stora S e disk - Information which is utilized to form the 

sl gnafflffi — ' subscriber selection data 110 includes time 112, which 

^^^^^^^^^^^^/S^^^^t^^^ -herein . 35 corresponds to the time of an event, channel ID 114, 

In a preferred embodiment, the source related text 136 can program ID 116, volume level 118, channel change record 

be extracted from the source material 130 and stored in 119, and program title 117. A detailed record of selection 

memory. The source related text 136, as denned herein, data is illustrated in FIG. 6. 

includes source related textual information including 4Q In a preferred embodiment, a household viewing habits 

descriptive fields which are related to the source material 195 illustrated in FIG. 1 is computed from the subscriber 

130, or text which is part of the source material 130 itself. selection data 110. The SCS 100 transfers household view- 

The source related text 136 can be derived from a number of ing data 197 to form household viewing habits 195. The 

sources including but not limited to closed captioning household viewing data 197 is derived from the subscriber 

information, Electronic Program Guide (EPG) material, and 45 selection data 110 by looking at viewing habits at a particu- 

text information in the source itself (e.g. text in HTML files). lar time of day over an extended period of time, usually 

Electronic Program Guide (EPG) 140 contains informa- several days or weeks, and making some generalizations 
lion related to the source material 130 which is useful to the regarding the viewing habits during that time period, 
user 120. The EPG 140 is typically a navigational tool which The program characteristics vector 150 is derived from 
contains source related information including but not limited 50 the source related text 136 and/or from the EPG 140 by 
to the programming category, program description, rating, applying information retrieval techniques. The details of this 
actors, and duration. The structure and content of EPG data process are discussed in accordance with FIG. 8. 
is described in detail in U.S. Pat. No. 5,596,373 assigned to The program characteristics vector 150 is used in corn- 
Sony Corporation and Sony Electronics which is herein binalion with a set of the heuristic rules 160 to define a set 
incorporated by reference. As shown in FIG. 1, the EPG 140 55 of the program demographic vectors 170 illustrated in FIG. 
can be accessed by the SCS 100 by a request EPG data signal 1 describing the audience the program is intended for. 
142 which results in the return of a category 144, a sub- One output of the SCS 100 is a household profile includ- 
category 146, and a program description 148. mg household demographic characteristics 190 and a house - 

In one embodiment of the present invention, EPG data is hold interest profile 180. The household demographic char- 
accessed and program information such as the category 144, 60 acteristics 190 resulting from the transfer of household 
the sub-category 146, and the program description 148 are demographic data 192, and the household interest profile 
stored in memory. 180, resulting from the transfer of household interests data 

In another embodiment of the present invention, the 182. Both the household demographics characteristics 190 

source related text 136 is the closed captioning text embed- and the household interest profile 180 have a session value 

ded in the analog or digital video signal. Such closed 65 and an average value, as will be discussed herein, 
captioning text can be stored in memory for processing to The monitoring system depicted in FIG. 2 is responsible 

extract the program characteristic vectors 150. for monitoring the subscriber activities, and can be used to 
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In an application of the system for television services, an 
input connector 220 accepts the video signal coming either 
from an antenna, cable television input, or other network. 
The video signal can be analog or Digital MPEG. 
Alternatively, the video source may be a video stream or 
other multimedia stream from a communications network 
including the Internet. 

In the case of either analog or digital video, selected fields 
are defined to carry EPG data or closed captioning text. For 
analog video, the closed captioning text is embedded in the 
vertical blanking interval (VBI). As described in U.S. Pat. 
No. 5,579,005, assigned to Scientific-Atlanta, Inc., the EPG 
information can be carried in a dedicated channel or embed- 
ded in the VBI. For digital video, the closed captioning text 
is carried as video user bits in a user_data field. The EPG 
data is transmitted as ancillary data and is multiplexed at the 
transport layer with the audio and video data. 

Referring to FIG. 2, a system control unit 200 receives 
commands from the user 120, decodes the command and 
forwards the command to the destined module. In a pre- 
ferred embodiment, the commands are entered via a remote 
control to a remote receiver 205 or a set of selection buttons 
207 available at the front panel of the system control unit 
200. In an alternate embodiment, the commands are entered 
by the user 120 via a keyboard. 

The system control unit 200 also contains a Central 
Processing Unit (CPU) 203 for processing and supervising 
all of the operations of the system control unit 200, a Read 
Only Memory (ROM) 202 containing the software and fixed 
data, a Random Access Memory (RAM) 204 for storing 
data. CPU 203, RAM 204, ROM 202, and I/O controller 201 
are attached to a master bus 206. A power supply in a form 
of battery can also be included in the system control unit 200 
for backup in case of power outage. 

An input/output (I/O) controller 201 interfaces the system 
control unit 200 with external devices. In a preferred 
embodiment, the I/O controller 201 interfaces to the remote 
receiver 205 and a selection button such as the channel 
change button on a remote control. In an alternate 
embodiment, it can accept input from a keyboard or a 
mouse. 

The program selection data 122 is forwarded to a channel 
processor 210. The channel processor 210 tunes to a selected 
channel and the media stream is decomposed into its basic 
components: the video stream, the audio stream, and the data 
stream. The video stream is directed to a video processor 
module 230 where it is decoded and further processed for 
display to the TV screen. The audio stream is directed to an 
audio processor 240 for decoding and output to the speakers. 

The data stream can be EPG data, closed captioning text, 
Extended Data Service (EDS) information, a combination of 
these, or an alternate type of data. In the case of EDS the call 
sign, program name and other useful data arc provided. In a 
preferred embodiment, the data stream is stored in a reserved 
location of the RAM 204. In an alternate embodiment, a 
magnetic disk is used for data storage. The system control 
unit 200 writes also in a dedicated memory, which in a 
preferred embodiment is the RAM 204, the selected channel, 
the time 112 of selection, the volume level 118 and the 
program ID 116 and the program title 117. Upon receiving 
the program selection data 122, the new selected channel is 
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directed to the channel processor 210 and the system control 
unit 200 writes to the dedicated memory the channel selec- 
tion end time and the program title 117 at the time 112 of 
channel change. The system control unit 200 keeps track of 
the number of channel changes occurring during the viewing 
time via the channel change record 119. This data forms part 
of the subscriber selection data 110. 

The volume control signal 124 is sent to the audio 
processor 240. In a preferred embodiment, the volume level 
118 selected by the user 120 corresponds to the listening 
volume. In an alternate embodiment, the volume level 118 
selected by the user 120 represents a volume level to another 
piece of equipment such as an audio system (home theatre 
system) or to the television itself. In such a case, the volume 
can be measured directly by a microphone or other audio 
sensing device which can monitor the volume at which the 
selected source material is beings listened. 

A program change occurring while watching a selected 
channel is also logged by the system control unit 200. 
Monitoring the content of the program at the time of the 
program change can be done by reading the content of the 
EDS. The EDS contains information such as program title, 
which is transmitted via the VBI. A change on the program 
title field is detected by the monitoring system and logged as 
an event. In an alternate embodiment, an EPG is present and 
program information can be extracted from the EPG. In a 
preferred embodiment, the programming data received from 
the EDS or EPG permits distinguishing between entertain- 
ment programming and advertisements. 

FIG. 3 shows the block diagram of the channel processor 
210. In a preferred embodiment, the input connector 220 
connects to a tuner 300 which tunes to the selected channel. 
A local oscillator can be used to heterodyne the signal to the 
IF signal. A demodulator 302 demodulates the received 
signal and the output is fed to an FEC decoder 304. The data 
stream received from the FEC decoder 304 is, in a preferred 
embodiment, in an MPEG format. In a preferred 
embodiment, system demultiplexer 306 separates out video 
and audio information for subsequent decompression and 
processing, as well as ancillary data which can contain 
program related information. 

The data stream presented to the system demultiplexer 
306 consists of packets of data including video, audio and 
ancillary data. The system demultiplexer 306 identifies each 
packet from the stream ID and directs the stream to the 
corresponding processor. The video data is directed to the 
video processor module 230 and the audio data is directed to 
the audio processor 240. The ancillary data can contain 
closed captioning text, emergency messages, program guide, 
or other useful information. 

Closed captioning text is considered to be ancillary data 
and is thus contained in the video stream. The system 
demultiplexer 306 accesses the user data field of the video 
stream to extract the closed captioning text. The program 
guide, if present, is carried on data stream identified by a 
specific transport program identifier. 

In an alternate embodiment, analog video can be used. For 
analog programming, ancillary data such as closed caption- 
ing text or EDS data are carried in a vertical blanking 
interval. 

FIG. 4 shows the block diagram of a computer system for 
a realization of the subscriber monitoring system based on 
the reception of multimedia signals from a bi-directional 
network. A system bus 422 transports data amongst the CPU 
203, the RAM 204, Read Only Memory— Basic Input 
Output System (ROM-BIOS) 406 and other components. 
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The CPU 203 accesses a hard drive 400 through a disk 
controller 402. The standard input/output devices are con- 
nected to the system bus 422 through the I/O controller 201. 
A keyboard is attached to the I/O controller 201 through a 
keyboard port 416 and the monitor is connected through a 
monitor port 418. The serial port device uses a serial port 
420 to communicate with the I/O controller 201. Industry 
Standard Architecture (ISA) expansion slots 408 and Periph- 
eral Component Interconnect (PCI) expansion slots 410 
allow additional cards to be placed into the computer. In a 
preferred embodiment, a network card is available to inter- 
face a local area, wide area, or other network. 

FIG. 5 illustrates a channel sequence and volume over a 
twenty-four (24) hour period. The Y-axis represents the 
status of the receiver in terms of on/off status and volume 
level. The X-axis represents the time of day. The channels 
viewed are represented by the windows 501-506, with a first 
channel 502 being watched followed by the viewing of a 
second channel 504, and a third channel 506 in the morning. 
In the evening a fourth channel 501 is watched, a fifth 
channel 503, and a sixth channel 505. A channel change is 
illustrated by a momentary transition to the "off' status and 
a volume change is represented by a change of level on the 
Y-axis. 

A detailed record of the subscriber selection data 110 is 
illustrated in FIG. 6 in a table format. A time column 602 
conlaias the starting lime of every event occurring during 
the viewing time. A Channel ID column 604 lists the 
channels viewed or visited during that period. A program 
title column 603 contains the titles of all programs viewed. 
A volume column 601 contains the volume level 118 at the 
time 112 of viewing a selected channel. 

A representative statistical record corresponding to the 
household viewing habits 195 is illustrated in FIG. 7. In a 
preferred embodiment, a time of day column 700 is orga- 
nized in period of time including morning, mid-day, 
afternoon, night, and late night. In an alternate embodiment, 
smaller time periods are used. A minutes watched column 
702 lists, for each period of lime, the time in minutes in 
which the SCS 100 recorded delivery of programming. The 
number of channel changes during that period and the 
average volume are also included in that table in a channel 
changes column 704 and an average volume column 706 
respectively. The last row of the statistical record contains 
the totals for the items listed in the minutes watched column 
702, the channel changes column 704 and the average 
volume 706. 

FIG. 8A illustrates an entity-relationship diagram for the 
generation of the program characteristics vector 150. The 
context vector generation and retrieval technique described 
in U.S. Pat. No. 5,619,709, which is incorporated herein by 
reference, can be applied for the generation of the program 
characteristics vectors 150. Other techniques are well known 
by those skilled in the art. 

Referring to FIG. 8 A, the source material 130 or the EPG 
140 are passed through a program characterization process 
800 to generate the program characteristics vectors 150. The 
program characterization process 800 is described in accor- 
dance with FIG. 8B. Program content descriptors including 
a first program content descriptor 802, a second program 
content descriptor 804 and an nth program content descrip- 
tor 806, each classified in terms of the category 144, the 
sub-category 146, and other divisions as identified in the 
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expected content of material found in the particular program 
category 144. In this example, the program content descrip- 
tors 802, 804 and 806 would conlain text representative of 
what would be found in programs in the news, fiction, and 
advertising categories respectively. The context vector gen- 
erator 820 generates context vectors for that set of sample 
texts resulting in a .first summary context vector 808, a 
second summary context vector 810, and an nth summary 
context vector 812. In the example given, the summary 
context vectors 808, 810, and 812 correspond to the catego- 
ries of news, fiction and advertising respectively. The sum- 
mary vectors are stored in a local data storage system. 

Referring to FIG. 8B, a sample of the source related text 
136 which is associated with the new program to be clas- 
sified is passed to the context vector generator 820 which 
generates a program context vector 840 for that program. 
The source related text 136 can be either the source material 
130, the EPG 140, or other text associated with the source 
material. A comparison is made between the actual program 
context vectors and the stored program content context 
vectors by computing, in a dot product computation process 
830, the dot product of the first summary context vector 808 
with the program context vector 840 to produce a first dot 
product 814. Similar operations are performed to produce 
second dot product 816 and nth dot product 818. 

The values contained in the dot products 814, 816 and 
818, while not probabilistic in nature, can be expressed in 
probabilistic terms using a simple transformation in which 
the result represents a confidence level of assigning the 
corresponding content to that program. The transformed 
values add up to one. The dot products can be used to 
classify a program, or form a weighted sum of classifications 
which results in the program characteristics vectors 150. In 
the example given, if the source related text 136 was from 
an advertisement, the nth dot product 818 would have a high 
value, indicating that the advertising category was the most 
appropriate category, and assigning a high probability value 
to that category. If the dot products corresponding to the 
other categories were significantly higher than zero, those 
categories would be assigned a value, with the result being 
the program characteristics vectors 150 as shown in FIG. 
9D. 

For the sub-categories, probabilities obtained from the 
content pertaining to the same sub-category 146 are summed 
to form the probability for the new program being in that 
sub-category 146. At the sub-category level, the same 
method is applied to compute the probability of a program 
being from the given category 144. The three levels of the 
program classification system; the category 144, the sub- 
category 146 and the content, are used by the program 
characterization process 800 to form the program charac- 
teristics vectors 150 which are depicted in FIGS. 9D-9F. 

The program characteristics vectors 150 in general are 
represented in FIGS. 9A through 9F. FIGS. 9A, 9B and 9C 
are an example of deterministic program vectors. This set of 
vectors is generated when the program characteristics arc 
well defined, as can occur when the source related text 136 
or the EPG 140 contains specific fields identifying the 
category 144 and the sub-category 146. A program rating 
can also provided by the EPG 140. 

In the case that these characteristics are not specified, a 
statistical set of vectors is generated from the process 
described in accordance with FIG. 8. FIG. 9D shows the 



industry accepted program classification system, are pre- 65 probability that a program being watched is from the given 
sented to a context vector generator 820. As an example, the category 144. The categories are listed in the X-axis. The 
program content descriptor can be text representative of the sub-category 146 is also expressed in terms of probability. 
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This is shown in FIG. 9E. The content component of this set 
of vectors is a third possible level of the program 
classification, and is illustrated in FIG. 9F. 

FIG. 10A illustrates sets of logical heuristics rules which 
form part of the heuristic rules 160. In a preferred 5 
embodiment, logical heuristic rules are obtained from socio- 
logical or psychological studies. Two types of rules are 
illustrated in FIG. 10A. The first type links an individual's 
viewing characteristics to demographic characteristics such 
as gender, age, and income level. A channel change rate rule 10 
1030 attempts to determine gender based on channel change 
rate. An income related channel change rate rule 1010 
attempts to link channel change rates to income brackets. A 
second type of rules links particular programs to particular 
audience, as illustrated by a gender determining rule 1050 15 
which links the program category 144/sub-category 146 
with a gender. The result of the application of the logical 
heuristic rules illustrated in FIG. 10A are probabilistic 
determinations of factors including gender, age, and income 
level. Although a specific set of logical heuristic rules has 20 
been used as an example, a wide number of types of logical 
heuristic rules can be used to realize the present invention. 
In addition, these rules can be changed based on learning 
within the system or based on external studies which provide 
more accurate rules. 25 

FIG. 10B illustrates a set of the heuristic rules 160 
expressed in terms of conditional probabilities. In the 
example shown in FIG. 10B, the category 144 has associated 
with it conditional probabilities for demographic factors 
such as age, income, family size and gender composition. 30 
The category 144 has associated with it conditional prob- 
abilities that represent probability that the viewing group is 
within a certain age group dependent on the probability that 
they are viewing a program in that category 144. 

FIG. 11 illustrates an entity-relationship diagram for the 35 
generation of the program demographic vectors 170. In a 
preferred embodiment, the heuristic rules 160 are applied 
along with the program characteristic vectors 150 in a 
program target analysis process 1100 to form the program 
demographic vectors 170. The program characteristic vec- 40 
tors 150 indicate a particular aspect of a program, such as its 
violence level. The heuristic rules 160 indicate that a par- 
ticular demographic group has a preference for that program. 
As an example, it may be the case that young males have a 
higher preference for violent programs than other sectors of 45 
the population. Thus, a program which has the program 
characteristic vectors 150 indicating a high probability of 
having violent content, when combined with the heuristic 
rules 160 indicating that "young males like violent 
programs," will result, through the program target analysis 50 
process 1100, in the program demographic vectors 170 
which indicate that there is a high probability that the 
program is being watched by a young male. 

The program target analysis process 1100 can be realized 
using software programmed in a variety of languages which 55 
processes mathematically the heuristic rules 160 to derive 
the program demographic vectors 170. The table represen- 
tation of the heuristic rules 160 illustrated in FIG. 10B 
expresses the probability that the individual or household is than one unless only one program has been viewed) and 
from a specific demographic group based on a program with 60 summed with the previous value. Time duration data, along 



Expressing the probability that a program is destined to a 
specific demographic group can be determined by applying 
Bayes rule. This probability is the sum of the conditional 
probabilities that the demographic group likes the program, 
conditional to the category 144 weighted by the probability 
that the program is from that category 144. In a preferred 
embodiment, the program target analysis can calculate the 
program demographic wectors by application of logical 
heuristic rules, as illustrated in FIG. 10A, and by application 
of heuristic rules expressed as conditional probabilities as 
shown in FIG. 10B. Logical heuristic rules can be applied 
using logical programming and fuzzy logic using techniques 
well understood by those skilled in the art, and are discussed 
in the text by S. V. Kartalopoulos entitled "Understanding 
Neural Networks and Fuzzy Logic" which is incorporated 
herein by reference. 

Conditional probabilities can be applied by simple math- 
ematical operations multiplying program context vectors by 
matrices of conditional probabilities. By performing this 
process over all the demographic groups, the program target 
analysis process 1100 can measure bow likely a program is 
to be of interest to each demographic group. Those prob- 
abilities values form the program demographic vector 170 
represented in FIG. 12. 

As an example, the heuristic rules expressed as condi- 
tional probabilities shown in FIG. 10 B are used as part of a 
matrix multiplication in which the program characteristics 
vector 150 of dimension N, such as those shown in FIGS. 
9A-9F is multiplied by an NxM matrix of heuristic rules 
expressed as conditional probabilities, such as that shown in 
FIG. 10B. The resulting vector of dimension M is a weighted 
average of the conditional probabilities for each category 
and represents the household demographic characteristics 
190. Similar processing can be performed at the sub- 
category and content levels. 

FIG. 12 illustrates an example of the program demo- 
graphic vector 170, and shows the extent to which a par- 
ticular program is destined to a particular audience. This is 
measured in terms of probability as depicted in FIG. 12. The 
Y-axis is the probability of appealing to the demographic 
group identified on the X-axis. 

FIG. 13 illustrates an entity -relationship diagram for the 
generation of household session demographic data 1310 and 
household session interest profile 1320. In a preferred 
embodiment, the subscriber selection data 110 is used along 
with the program characteristics vectors 150 in a session 
characterization process 1300 to generate the household 
session interest profile 1320. The subscriber selection data 
110 indicates what the subscriber is watching, for how long 
and at what volume they are watching the program. 

^Xpreferred .emb^dimentpth c-sessionc haracterizatioai 
process 1300 forms a weighted average of "the program 
characteristics vectors 150 in which the lime duration the 
program is watched is normalized to the session time 
(typically defined as the time from which the unit was turned 
on to the present). The program characteristics vectors 150 
are multiplied by the normalized time duration (which is less 



a particular category 144. This can be expressed, using 
probability terms as follow "the probability that the indi- 
viduals are in a given demographic group conditional to the 
program being in a given category". Referring to FIG. 9D, 
the probability that the group has certain demographic 
characteristics based on the program being in a specific 
category is illustrated. 
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with other subscriber viewing information, is available from 
the subscriber selection data 110. The resulting weighted 
average of program characteristics vectors forms the house- 
hold session interest profile 1320, with each program con- 
tributing to the household session interest profile 1320 
according to how long it was watched. The household 
session interest profile 1320 is normalized to produce proba- 
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bilistic values of the household programming interests dur- 
ing that session. 

In an alternate embodiment, the heuristic rules 160 are 
applied to both the subscriber selection data 110 and the 
program characteristics vectors 150 to generate the house- 5 
hold session demographic data 1310 and the household 
session interest profile 1320. In this embodiment, weighted 
averages of the program characteristics vectors 150 are 
formed based on the subscriber selection data 110, and the 
heuristic rules 160 are applied. In the case of logical 10 
heuristic rules as shown in FIG. 10 A, logical programming 
can be applied to make determinations regarding the house- 
hold session demographic data 1310 and the household 
session interest profile 1320. In the case of heuristic rules in 
the form of conditional probabilities such as those illustrated 15 
in FIG. 10B, a dot product of the time averaged values of the 
program characteristics vectors can be taken with the appro- 
priate matrix of heuristic rules to generate both the house- 
hold session demographic data 1310 and the household 
session interest profile 1320. 20 

Volume control measurements which form part of the 
subscriber selection data 110 can also be applied in the 
session characterization process 1300 to form a household 
session interest profile 1320. This can be accomplished by 
using normalized volume measurements in a weighted aver- 25 
age manner similar to how time duration is used. Thus, 
muting a show results in a zero value for volume, and the 
program characteristics vector 150 for this show will not be 
averaged into the household session interest profile 1320. 

FIG. 14 illustrates an entity -relationship diagram for the 
generation of average household demographic characteris- 
tics and session household demographic characteristics 190. 
A household demographic characterization process 1400 
generates the household demographic characteristics 190 35 
represented in table format in FIG. 15. The household 
demographic characterization process 1400 uses the house- 
hold viewing habit 195 in combination with the heuristic 
rules 160 to determine demographic data. For example, a 
household with a number of minutes watched of zero during 4Q 
the day may indicate a household with two working adults. 
Both logical heuristic rules as well as rules based on 
conditional probabilities can be applied to the household 
viewing habits 195 to obtain the household demographics 
characteristics 190. 

45 

The household viewing habits 195 is also used by the 
system to detect out-of-habits events. For example, if a 
household with a zero value for the minutes watched column 
702 at late night presents a session value at that time via the 
household session demographic data 1310, this session will 50 
be characterized as an out-of-habits event and the system 
can exclude such data from the average if it is highly 
probable that the demographics for that session are greatly 
different than the average demographics for the household. 
Nevertheless, the results of the application of the household 55 
demographic characterization process 1400 to the household 
session demographic data 1310 can result in valuable ses- 
sion demographic data, even if such data is not added to the 
average demographic characterization of the household. 

FIG. 15 illustrates the average and session household 60 
demographic characteristics. A household demographic 
parameters column 1501 is followed by an average value 
column 1505, a session value column 1503, and an update 
column 1507. The average value column 1505 and the 
session value column 1503 are derived from the household 65 
demographic characterization process 1400. The determin- 
istic parameters such as address and telephone numbers can 



be obtained from an outside source or can be loaded into the 
system by the subscriber or a network operator at the time 
of installation. Updating of deterministic values is prevented 
by indicating that these values should not be updated in the 
update column 1507. 

FIG. 16 illustrates an entity-relationship diagram for the 
generation of the household interest profile 180 in a house- 
hold interest profile generation process 1600. In a preferred 
embodiment, the household interest profile generation pro- 
cess comprises averaging the household session interest 
profile 1320 over multiple sessions and applying the house- 
hold viewing habits 195 in combination with the heuristic 
rules 160 to form the household interest profile 180 which 
takes into account both the viewing preferences of the 
household as well as assumptions about households/ 
subscribers with those viewing habits and program prefer- 
ences. 

FIG. 17 illustrates the household interest profile 180 
which is composed of a programming types row 1709, a 
products types row 1707, and a household interests column 
1701, an average value column 1703, and a session value 
column 1705. 

The product types row 1707 gives an indication as to what 
type of advertisement the household would be interested in 
watching, thus indicating what types of products could 
potentially be advertised with a high probability of the 
advertisement being watched in its entirety. The program- 
ming types row 1709 suggests what kind of programming 
the household is likely to be interested in watching. The 
household interests column 1701 specifies the types of 
programming and products which arc statistically charac- 
terized for that household. 

As an example of the industrial applicability of the 
invention, a household will perform its normal viewing 
routine without being requested to answer specific questions 
regarding likes and dislikes. Children may watch television 
in the morning in the household, and may change channels 
during commercials, or not at all. The television may remain 
off during the working day, while the children are at school 
and day care, and be turned on again in the evening, at which 
lime the parents may "surf* channels, mute the television 
during commercials, and ultimately watch one or two hours 
of broadcast programming. The present invention provides 
the ability to characterize the household, and may make the 
determination that there are children and adults in the 
household, with program and product interests indicated in 
the household interest profile 180 corresponding to a family 
of that composition. Ahousehold with two retired adults will 
have a completely different characterization which will be 
indicated in the household interest profile 180. 

Although the present invention has been largely described 
in the context of a single computing platform receiving 
programming, the SCS 100 can be realized as part of a 
client-server architecture, as illustrated in FIG. 18. Referring 
to FIG. 18, residence 1800 contains a personal computer 
(PC) 1820 as well as the combination of a television 1810 
and a set-top 1808, which can request and receive program- 
ming. The equipment in residence 1800, or similar equip- 
ment in a small or large business environment, forms the 
client side of the network as defined herein. Programming is 
delivered over an access network 1830, which may be a 
cable television network, telephone type network, or other 
access network. Information requests are made by the client 
side to a server 1840 which forms the server side of the 
network. Server 1840 has content locally which it provides 
to the subscriber, or requests content on behalf of the 
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subscriber from a ihird party content provider I860, as 
illustrated in FIG. 18. Requests made on behalf of the client 
side by server 1840 are made across a wide area network 
1850 which can be the Internet or other public or private 
network. Techniques for making requests on behalf of a 
client are frequently referred to a proxy techniques and are 
well known to those skilled in the art. The server side 
receives the requested programming which is displayed on 
PC 1820 or television 1810 according to which device made 
the request. 

In a preferred embodiment the server 1840 maintains the 
subscriber selection data 110 which it is able to compile 
based on its operation as a proxy for the client side. Retrieval 
of source related information and the program target analy- 
sis process 1100, the program characterization process 800, 
the program target analysis process 1100, the session char- 
acterization process 1300, the household demographic char- 
acterization process 1400, and the household interest profile 
generation process 1600 can be performed by server 1840. 

As an example of the industrial applicability of the 
invention, a household will perform its normal viewing 
routine without being requested to answer specific questions 
regarding likes and dislikes. Children may watch television 
in the morning in the household, and may change channels 
during commercials, or not at all. The television may remain 
off during the working day, while the children are at school 
and day care, and be turned on again in the evening, at which 
time the parents may "surf" channels, mute the television 
during commercials, and ultimately watch one or two hours 
of broadcast programming. The present invention provides 
the ability to characterize the household, and may make the 
determination that there are children and adults in the 
household, with program and product interests indicated in 
the household interest profile 180 corresponding to a family 
of that composition. A household with two retired adults will 
have a completely different characterization which will be 
indicated in household interest profile 180, which is stored 
at server 1840 and can be accessed by various groups 
wishing to provide programming and advertisements to the 
members of residence 1800. 

Although this invention has been illustrated by reference 
to specific embodiments, it will be apparent to those skilled 
in the art that various changes and modifications may be 
made which clearly fall within the scope of the invention. 
The invention is intended to be protected broadly within the 
spirit and scope of the appended claims. 

What is claimed is: 

1. A method for generating a subscriber profile, the 
method comprising: 

monitoring subscriber interactions with a television; 

processing the subscriber interactions to generate viewing 
characteristics that identify viewing traits associated 
with the subscriber; 

retrieving heuristic rules associated with the viewing 
characteristics, wherein the heuristic rules predict traits 
about the subscriber not related to the viewing charac- 
teristics; and 

generating the subscriber profile by applying the heuristic 
rules to the viewing characteristics. 

2. The method of claim 1, further comprising storing the 
subscriber profile. 

3. The method of claim 1, wherein the heuristic rules are 
probabilistic in nature. 

4. The method of claim 1, wherein the subscriber profile 
is probabilistic in nature. 

5. The method of claim 1, wherein the heuristic rules 
predict demographic traits about the subscriber. 
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6. The method of claim 1, wherein the subscriber profile 
identifies demographic traits of the subscriber. 

7. The method of claim 1, wherein the heuristic rules 
predict product interest traits about the subscriber. 

8. The method of claim 1, wherein the subscriber profile 
identifies product interest traits of the subscriber. 

9. The method of claim 1, wherein said monitoring 
includes monitoring at least some subset of channel changes, 
volume changes, record commands, and associated time. 

10. The method of claim 1, wherein said processing 
includes aggregating the subscriber interactions to generate 
the viewing characteristics. 

11. The method of claim 10, wherein said aggregating 
includes aggregating the subscriber interactions for a single 
viewing session to generate session viewing characteristics 
that identify viewing traits associated with the subscriber for 
that viewing session. 

12. The method of claim 11, wherein 

said retrieving includes retrieving heuristic rules associ- 
ated with the session viewing characteristics; and 

said generating includes generating a session subscriber 
profile by applying the heuristic rules to the session 
viewing characteristics. 

13. The method of claim 10, wherein said aggregating 
includes aggregating the subscriber interactions for multiple 
viewing sessions to generate average viewing characteristics 
that identify viewing traits associated with the subscriber Tor 
the multiple viewing sessions. 

14. The method of claim 13, wherein 

said retrieving includes retrieving heuristic rules associ- 
ated with the average viewing characteristics; and 

said generating includes generating an average subscriber 
profile by applying the heuristic rules to the average 
viewing characteristics. 

15. The method of claim 1, wherein the viewing charac- 
teristics do not identify raw subscriber interaction data. 

16. The method of claim 1, wherein the viewing charac- 
teristics include at least some subset of 

viewing time per channel, category, and network; 

channel changes per time period; 

average volume per time period, channel, category, and 

network; and 
dwell time per channel, category, and network. 

17. The method of claim 1, further comprising retrieving 
source related information corresponding to requested 
source material, wherein the source related information 
includes at least some subset of title, category, subcategory, 
description, rating, and content. 

18. The method of claim 17, wherein said processing 
includes processing the subscriber interactions with respect 
to the source related information. 

19. A method for generating a subscriber profile for a 
subscriber, the method comprising: 

monitoring subscriber viewing activities including 

requests for source material; 
retrieving source related information corresponding to 

requested source material, wherein the source related 

information includes descriptions of at least one aspect 

of the requested source material; 
creating a first representation of the subscriber based on 

the subscriber viewing activities and the source related 

information; 

retrieving a set of rules associated with at least a subset of 
information contained in the subscriber viewing activi- 
ties and the source related information, wherein the set 
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of rules relates at least one aspect of the subscriber 
viewing activities or at least one aspect of the source 
related information to at least one non -viewing param- 
eter; and 

processing the subscriber viewing activities with respect 
to the source related information and the set of rules to 
generate the subscriber profile, wherein the subscriber 
profile defines at least a second representation of the 
subscriber. 

20. The method of claim 19, wherein the set of rules 
define probabilities associated with the at least one non- 
viewing parameter and the subscriber profile is probabilistic 
in nature. 

21. The method of claim 19, wherein at least one of the 
at least one non- vie wing parameters is a demographic 
parameter and the subscriber profile defines at least a 
demographic representation of the subscriber. 

22. The method of claim 19, wherein at least one of the 
at least one non-viewing parameters is a product interest 
parameter and the subscriber profile defines at least a 
product interest representation of the subscriber. 

23. A method for generating a subscriber profile for a 
subscriber, the method comprising: 

monitoring subscriber television viewing activities; 

retrieving source related information corresponding to 
requested source material, wherein the source related 
information includes at least one description of the 
requested source material; 

creating viewing characteristics based on the subscriber 
viewing activities and the source related information, 
wherein the viewing characteristics define a first rep- 
resentation of the subscriber; 

retrieving a set of rules associated with at least some 
subset of the viewing characteristics, wherein the set of 
rules defines at least one aspect of the subscriber not 
defined in the viewing characteristics; and 

generating the subscriber profile by applying the set of 
rules to the viewing characteristics, wherein the sub- 
scriber profile defines a second representation of the 
subscriber. 

24. A system for generating a subscriber profile, the 
system comprising: 

means for monitoring subscriber interactions with a tele- 
vision; 

means for processing the subscriber interactions to gen- 
erate viewing characteristics that identify viewing traits 
associated with the subscriber; 

means for retrieving heuristic rules associated with the 
viewing characteristics, wherein the heuristic rules 
predict traits about the subscriber not related to the 
viewing characteristics; and 

means for generating the subscriber profile by applying 
the heuristic rules to the viewing characteristics. 

25. The system of claim 24, wherein said means for 
processing aggregates the subscriber interactions to generate 
the viewing characteristics. 

26. The system of claim 24, further comprising means for 
retrieving source related information corresponding to 
requested source material, wherein the source related infor- 
mation includes descriptions of at least one aspect of the 
requested source material. 

27. The system of claim 26, wherein said means for 
processing processes the subscriber interactions with respect 
to the source related information. 

28. A system for generating a subscriber profile for a 
subscriber, the method comprising: 
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means for monitoring subscriber viewing activities 
including requests for source material; 

means for retrieving source related information corre- 
sponding to requested source material, wherein the 
source related information describes at least one aspect 
of the requested source material; 

means for creating viewing characteristics based on the 
subscriber viewing activities and the source related 
information, wherein the viewing characteristics define 
a first representation of the subscriber; 

means for retrieving a set of rules associated with at least 
some subset of the viewing characteristics, wherein the 
set of rules defines at least one aspect of the subscriber 
not defined in the viewing characteristics; and 

means for generating the subscriber profile by applying 
the set of rules to the viewing characteristics, wherein 
the subscriber profile defines a second representation of 
the subscriber. 

29. The system of claim 28, wherein 

said means for retrieving a set of rules retrieves a set of 
rules that define probabilities associated with the at 
least one aspect of the subscriber; and 

said means for generating the subscriber profile generates 
the subscriber profile as a probabilistic determination of 
the at least one aspect of the subscriber not defined in 
the viewing characteristics. 

30. The system claim 28, wherein 

said means for retrieving a set of rules retrieves a set of 
rules that define demographic parameters of the sub- 
scriber; and 

said means for generating the subscriber profile generates 
the subscriber profile as a demographic representation 
of the subscriber. 

31. The system claim 28, wherein 

said means for retrieving a set of rules retrieves a set of 
rules that define product interests of the subscriber; and 

said means for generating the subscriber profile generates 
the subscriber profile identifying product interests of 
the subscriber. 

32. A method for generating a subscriber profile, the 
method comprising: 

monitoring subscriber interactions with a television; 

processing the subscriber interactions in order to charac- 
terize interaction traits associated with the subscriber; 

retrieving heuristic rules associated with the interaction 
traits, wherein the heuristic rules predict non- 
interaction traits about the subscriber, the non- 
interaction traits not being limited to child/adult dis- 
tinctions; and 

generating the subscriber profile by applying the heuristic 
rules to the interaction traits. 

33. The method of claim 32, wherein said generating the 
subscriber profile includes generating the subscriber profile 
having at least one non-interaction trait that is not child/adult 
distinction related. 

34. The method of claim 32, wherein the non-interaction 
traits are demographic traits. 

35. The method of claim 32, wherein the heuristic rules 
associate the interaction traits to the non-interaction traits. 

36. The method of claim 35, wherein interaction traits to 
non-interaction traits associations include at least some 
subset of 

channel change speed to gender, income level or a com- 
bination thereof; 
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programs is to gender, work status, or a combination 
thereof; and 

program categories, program sub -categories, program 
content or some combination thereof to gender, age, 
income level, family size, or some combination thereof. 5 

37. The method of claim 32, wherein the heuristic rules 
assign probabilities for the subscriber having certain non- 
interaction traits associated therewith. 

38. The method of claim 37, wherein the non-interaction 
traits having probabilities assigned thereto include at least 10 
some subset of age, income, family size and gender. 

39. The method of claim 37, wherein each of the non- 
interaction traits are divided into categories and each cat- 
egory is assigned a probability. 

40. The method of claim 32, wherein the subscriber 15 
profile assigns probabilities associated with the subscriber 
having certain non-interaction traits. 

41. The method of claim 40, wherein the each of the 
non-interaction traits of the subscriber profile are divided 
into subcategories and each category is assigned a probabil- 20 
ity. 

42. The method of claim 32, wherein the non-interaction 
traits are product interests. 

43. The method of claim 42, wherein the product interests 
are not limited to products associated with children and 25 
products associated with adults. 

44. The method of claim 32, wherein the subscriber 
profile may represent a single subscriber or a household of 
subscribers. 

45. The method of claim 32, wherein the subscribers 30 
identity is not known. 

46. A method for generating a subscriber profile, the 
method comprising: 

processing subscriber interactions with a television in 
order to characterize interaction traits associated with 35 
the subscriber; 
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retrieving heuristic rules associated with the interaction 
traits, wherein the heuristic rules predict non- 
interaction trails about the subscriber; and 

generating the subscriber profile by applying the heuristic 
rules to the interaction traits, wherein the subscriber 
profile is not limited to child adult distinctions, adver- 
tising associated with children or advertising associated 
with adults. 

47. A method for generating a subscriber profile, the 
method comprising: 

accessing interaction traits defining how a subscriber 
interacts with a television; 

retrieving heuristic rules associated with the interaction 
traits, wherein the heuristic rules predict non- 
interaction traits about the subscriber, the non- 
interaction traits including at least some subset of 
gender, income level and family size; and 

generating the subscriber profile by applying the heuristic 
rules to the interaction traits. 

48. A method for generating a subscriber profile, the 
method comprising: 

characterizing subscriber interactions with a television as 
interaction traits; 

retrieving heuristic rules associated with the interaction 
traits, wherein the heuristic rules associate the interac- 
tion traits to non-interaction traits, the interaction traits 
to non-interaction traits associations including at least 
some subset of channel change speed to gender, chan- 
nel change speed to income level, program genre to 
gender, program genre to income level and program 
genre to family size; and 

generating the subscriber profile by applying the heuristic 
rules to the interaction trails. 

***** 
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